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In Bayesian nonparametric inference, random discrete probability measures are commonly used 
as priors within hierarchical mixture models for density estimation and for inference on the 
clustering of the data. Recently, it has been shown that they can also be exploited in species 
sampling problems: indeed they are natural tools for modeling the random proportions of species 
within a population thus allowing for inference on various quantities of statistical interest. For 
applications that involve large samples, the exact evaluation of the corresponding estimators 
becomes impracticable and, therefore, asymptotic approximations are sought. In the present 
paper, we study the limiting behaviour of the number of new species to be observed from 
further sampling, conditional on observed data, assuming the observations are exchangeable and 
directed by a normalized generalized gamma process prior. Such an asymptotic study highlights a 
connection between the normalized generalized gamma process and the two-parameter Poisson- 
Dirichlet process that was previously known only in the unconditional case. 
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generalized gamma process; polynomially and exponentially tilted random variables; 
a-diversity; species sampling models; two parameter Poisson-Dirichlet process 

1. Introduction 

In species sampling problems, one is interested in the species composition of a certain 
population (of plants, animals, genes, etc.) containing an unknown number of species and 
only a sample drawn from it is available. The relevance of such problems in ecology, biol- 
ogy and, more recently, in genomics and bioinformatics is not surprising. From an infer- 
ential perspective, one is willing to use available data in order to evaluate some quantities 
of practical interest. The available data specifically consist of a so-called basic sample of 
size n, (Xi, . . . , X n ), which exhibits K n G {1, . . . , n} distinct species, {X*, . . . ,X^ ), with 
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respective frequencies (Ni, . . . , Njc n ), where clearly J2i=i Ni = n. Given a basic sample, 
interest mainly lies in estimating the number of new species, '■= K m+n — K n , to be 
observed in an additional sample (X n+ i, . . . ,X n+m ) of size m and not included among 
the X*'s, j = l,...,K n . 

Most of the contributions in the literature that address this issue rely on a frequentist 
approach (see [4, 6] for reviews) and only recently an alternative Bayesian nonparametric 
approach has been set forth (see, e.g., [10, 12, 23, 26]). The latter resorts to a general class 
of discrete random probability measures, termed species sampling models and introduced 
by J. Pitman in [32]. Given a nonatomic probability measure Po on some complete and 
separable metric space X, endowed with the Borel cr-field 5£ , a (proper) species sampling 
model on (X, 3y) is a random probability measure 

i>l 

where (li)i>i is a sequence of independent and identically distributed (i.i.d.) random 
elements taking values in X and with probability distribution Po i the nonnegative random 
weights (pi)i>i are independent from (ii)i>i and are such that Si>iPi = 1> almost 
surely. In the species sampling context, the Y^s act as species tags and pi is the random 
proportion with which the ith species is present in the population. If (X n ) n >i is an 
exchangeable sequence directed by a species sampling model p 7 that is, for every n > 1 
and Ax, ... , A n in 3£ one has 

n 

P[Xj e A t , . . .,X n g A n \p] =J{p{Ai) (1.1) 

i=l 

almost surely, then (X n ) n >i is termed species sampling sequence. Besides being an effec- 
tive tool for statistical inference, species sampling models have an appealing structural 
property established in [32]. Indeed, if (X n ) n >i is a species sampling sequence, then there 
exists a collection of nonnegative weights {jPj, n {n\ , • ■ • , rik): 1 < j < k + 1,1 < k <n,n> 
1} such that 2j=iPj',n( ri i5---5 n fe) = 1' f° r an y vector of positive integers (ni,...,nfe) 
with Y)j—i Tij =n, and 

k„ 

P[X n+1 e •\X 1 ,...,X n ] =PK n +i,n(ni,...,n Kn )Po(-) + J~]pj,n(ni, n,K„)Sx? (•), 

3 = 1 

where X\, . . . , X n is a sample with K n distinct values Xi, . . . , X^ . Statistical applica- 
tions involving species sampling models for different purposes than those of the present 
paper are provided, for example, in [24, 29, 30]. 

The Bayesian nonparametric approach we undertake postulates that the data are ex- 
changeable and generated by a species sampling model. Then, conditionally on the basic 

(n) 

sample of size n, inference is to be made on the number K, n of new distinct species that 
will be observed in the additional sample of size m. Interest lies in providing both a point 
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estimate and a measure of uncertainty, in the form of a credible interval, for Km^ given 
(Xi, . . . , X n ). Since the conditional distribution of Km ^ becomes intractable for large sizes 
m of the additional sample, one is led to studying its limiting behaviour as m increases. 
Such asymptotic results, in addition to providing useful approximations to the required 
estimators, are also of independent theoretical interest since they provide useful insight on 
the behaviour of the models we focus on. The only discrete random probability measure 
for which a conditional asymptotic result, similar to the one investigated in this paper, 
is known, is the two-parameter Poisson-Dirichlet process, shortly denoted as PD(er, 9). 
According to [32], a PD(cr, 9) process is a species sampling model characterized by 

9 + K n a rij-o- 

PK n +X,i{ n lT--> n K n ) = —Z-, , Pj,n(, n lT--> n Kj = -7- l 1 - 2 ) 

+ n 9 + n 

with j = 1, . . . , K n , a £ (0, 1) and 9 > —a. In this case, [10] provide a result describing the 
conditional limiting behaviour of Km ■ In the present paper, wc focus on an alternative 
species sampling model, termed normalized generalized gamma process in [24]. As we 
shall see in the next section, it depends on two parameters a € (0, 1) and j3 > and, for 
the sake of brevity, is denoted by NGG(cr, j3). Moreover, it is characterized by 

, , ®(-l) l P l ' <r T(K n + l-l/<r,l3) n „, 

, , , , ELo(?)(-l)'/3' /j r(K»-//a;/3) 
Pi n (ni, . . . , riK ) = (rij — a) H — r- (1-4) 

for any j G {1, . . . , K n }, where T(a; x) is the incomplete gamma function. The NGG(cr, 0) 
process prior has gained some attention in the Bayesian literature and it has proved to 
be useful for various applications such as those considered, for example, in [1, 2, 14- 
16, 24]. It is to be noted that the NGG(cr, 9) does not feature a posterior structure 
that is as tractable as the one associated to the PD(cr, 9) process (see, e.g., [5, 20, 26, 
32]). Nonetheless, in terms of practical implementation, it is possible to devise efficient 
simulation algorithms that allow for a full Bayesian analysis within models based on a 
NGG(cr,/3) prior. See [25] for a review of such algorithms. 

In the present manuscript, we will specify the asymptotic behaviour of K m , given the 
basic sample, as m diverges and highlight the interplay between the conditional distribu- 
tions of the PD(er, 9) and the NGG(<7, /3) processes. Since the posterior characterization 
of a NGG(er, /3) process is far more involved than the one associated to the PD(<r, 9) 
process, the derivation of the conditional asymptotic results considered in this paper is 
technically more challenging. This is quite interesting since it suggests that it is possible 
to study the limiting conditional behaviour of K m even beyond species sampling mod- 
els sharing some sort of conjugacy property For example, one might conjecture that the 
same asymptotic regime, up to certain transformations of the limiting random variable, 
should hold also for the wide class of Gibbs-type priors, to be recalled in Section 2. An up 
to date account of Bayesian Nonparametrics can be found in the monograph [18] and, in 
particular for asymptotic studies, [11] provides a review of asymptotics of nonparametric 
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models in terms of "frequentist consistency." Yet another type of asymptotic results are 
obtained in [8, 31]. 

The outline of the paper is as follows. In Section 2, one can find a basic introduction 
to species sampling models and a recollection of some results in the literature concerning 
the asymptotic behaviour of the number K n of distinct species in the basic sample, as 
n increases. Section 3 displays the main results, whereas the last section contains some 
concluding remarks. 



2. Species sampling models and Gibbs-type priors 

Let us start by providing a succinct description of completely random measures (CRM) 
before defining the specific models wc will consider and which can be derived as suitable 
transformations of CRMs. See [25] for an overview of discrete nonparametric models 
defined in terms of CRMs. 

Suppose ft is a random element defined on some probability space J^,P) and taking 
values on the space Mx of boundedly finite measures on (X, S") such that for any 
Ai,...,A n in 3£ , with A; n Aj = for i^j, the random variables fl(Ai), . . . , ft(A n ) 
are mutually independent. Then ft is termed completely random measure (CRM). It is 
well-known that the Laplace functional transform of ft has a simple representation of the 
type 

E [ e -//d/i] =e -^(/) ; 

where tp{f) = J R + xX [l — e~ s ^ y ^]v(ds,dy) for any measurable function /:X— > R such 
that J \ f\dft < oo almost surely and the measure v on R + x X is known as the Levy 
intensity of ft. See, for example, [21]. Since a CRM is almost surely discrete, any CRM 
can be represented as ft = X);>i Ji$Yi with independent random jump locations (ii)i>i 
and heights (Jj)i>i. For our purposes, it is enough to focus on the special case of v 
factorizing as ^(ds,da;) = pis) dsa(dcc), which implies independence of the locations Yi's 
and jumps Jj's in the above series representation. Furthermore, a can be taken to be 
nonatomic and finite, the latter ensuring almost sure finiteness of the corresponding 
CRM. Now, if card({ Ji : i > 1} D (0, e)) = p(s) ds = oo for any e > 0, one can define a 
random probability measure on X as 

(2.1) 



ft(X) 



This family of random probability measures is known from [19] as homogeneous nor- 
malized random measure with independent increments, a subclass of the general class of 
normalized processes introduced in [35]. Note that an X- valued exchangeable sequence 
(X n ) n >i generated by p as in (2.1) is a species sampling sequence. 

Here we focus on a specific example where the CRM defining p in (2.1) is the so-called 
generalized gamma process [3] that is characterized by 
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with a E (0, 1) and r > 0. In this case, 

lK/)= /[(/(.*) +T) CT -T>(dx) (2.2) 



for any measurable function / : X — > M. such that / \f\ a da < oo. In the sequel the model 
will be reparameterized, without loss of generality (see, e.g., [24, 33]), by setting j3 := 
t 17 and a as a probability measure. The corresponding CRM will be denoted by flo-.p- 
Henceforth, the random probability measure p obtained by normalizing fi a ,[3 as in (2.1) 
coincides, in distribution, with the NGG(cx, f3) process prior. An important special case 
arises when /3 = 0, since jl a fl reduces to the er-stable process, which plays a key role within 
the paper. For example, it is worth noting that fj, a ,p can also be defined as an exponential 
tilting of jlafl, for any j3 > 0. Specifically, if P a ,o is the probability distribution of firrfi on 
Aix and Po-,/3 is a probability measure on Mx that is absolutely continuous with respect 
to P CT o and such that 



( A1 )=exp{/3-/3 1 /V(X)} (2.3) 
(nr<r,o 

then P CTi( 3 coincides with the probability distribution of fi a ,p- In a similar fashion, one can 
also define the PD(<r, 9) process as a polynomial tilting of ft a .o, for any 9 > —a. Indeed, 
one introduces another probability measure Q a ,8 that is still absolutely continuous with 
respect to P CT> o and whose Radon-Nykodim derivative is 

dQo\0 , , Tje + i) e 

dP^ M = r ( ^ + i) [At(x)] (2 - 4) 

for any a E (0, 1) and 9 > —a. If /z* e is the random measure with probability distribution 
Q„ t e above, then p* = [i* a g/ g(X) coincides, in distribution, with a PD(cr, 9) process. 
See [34]. The different tilting structure featured by the normalized generalized gamma 
process and the two parameter Poisson-Dirichlet process will be reflected by the limiting 
results to be illustrated in the paper. 

It is also worth to recall that both the NGG(cr, (3) and the PD(cr, 6) processes can 
be seen as elements of the general class of Gibbs-type nonparametric priors introduced 
in [13]. Gibbs-type priors represent the most tractable subclass of species sampling mod- 
els. They are characterized by a parameter a < 1 and a collection of non-negative quan- 
tities {V rii k- n> 1,1 < k <n} that satisfy the forward recursive relations 

V n ,k = + (n— ka)V n+ i t k- 

These V^'s define the predictive weights that characterize a species sampling sequence 
governed by a Gibbs-type prior. Indeed, one has 

, s Vn+X,K n +l , v V n +l,K n , \ fn e\ 

PK n +i,n\nx,...,n K J = — 77 , ft>(ni,---,nif„) = — \Jij-a) (2.5) 

V n ,K„ 'n,K„ 
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for each j £ {1, . . . ,K n }. The fundamental simplification involved in (2.5) is that the 
probability of observing a "new" or an "old" species depend on the sample size and on 
the number of already observed distinct species but not on their frequencies: this crucially 
simplifies explicit calculations. Turning to the two specific processes introduced before, 
in accordance with (1.2), the PD(cr, 9) process identifies a Gibbs-type prior with 

v _ nS(g+^) 

"' k ~ (<? + l)n-l ' 

whereas, in accordance with (1.3) and (1.4), a NGG(er, f3) prior is also of Gibbs-type with 
a £ (0, 1) and 

•"■wKv 1 )'-^-?')- 

As shown in [27], a normalized CRM is a Gibbs-type prior (with a £ (0, 1)) if and only 
if it is a NGG(<7, 0) process. This result also motivates the focus of the paper on the 
NGG(cr,/3) process, which clearly has a prominent role. 

The result on the limiting behaviour of Km ^ to be determined in the next section 
parallels known results for the unconditional case where one aims at determining the 
asymptotics of K n as the sample size n increases and connects to the conditional asymp- 
totics displayed in [10] for the PD(cr, 9) process. In order to describe the result for the 
unconditional case, let T a $ := /v,o(X) be the random total mass of a cr-stable CRM and 
denote by f a its density function which satisfies J Q e~ As / (T (s) ds = c~ A for any A > 0. 
Moreover, let T a ^ := fl ai p(K.) be the random total mass of NGG(er, /?) process and recall 
that its law can be obtained by exponentially tilting the probability distribution of T ay0 
as in (2.3). In particular, if 

Sa,p=T~J, (2.6) 
then its density function, with respect to the Lebesgue measure on R, coincides with 



and one has that 

Kn 



a . s . 



S„,p. (2.7) 



According to the terminology introduced by [33], the random variable S a ,p is the so- 
called a- diversity of the exchangeable random partition induced by a NGG(ct, /3) process 
prior. See also Definition 3.10 in Pitman [34]. Note that a similar result holds true for the 

PD(cr, 9) process. Indeed, if T' a g = /i* e (X) so that its probability distribution is obtained 
by polynomially tilting the probability distribution of X^o as in (2.4) and 

S' a , g = {T')-° (2.8) 
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admits density function 

T(e + i) s /*- 1 /*- 1 



Then one has 



^^S'^ e . (2.9) 

See [34], Theorem 3.8. These results are somehow in line with the fact that the NGG(<7, 0) 
and the PD(tr, 9) processes are distributionally equivalent to normalized random mea- 
sures that are obtained by an exponential and a polynomial tilting, respectively, of a 
er-stable CRM as highlighted in (2.3) and in (2.4). Finally, note that a combination of 
[13], Theorem 12, and [33], Proposition 13, shows that the unconditional asymptotic 
results in (2.7) and (2.9) can be extended to the whole class of Gibbs-type priors. See 
also [17] for another contribution at the interface between Bayesian Nonparametrics and 
Gibbs-type random partitions. 



3. Asymptotics of K$ with a NGG(<r,/3) process 

As mentioned before, inference on is of great importance since it provides a measure 
of species richness of a community of plants/animals or of a cDNA library for gene 
discovery. The key quantity for obtaining posterior inferences is given by the probability 
distribution P[Km = k\X\, . . . , X n ] for k = 0, . . . , m. By virtue of predictive sufficiency 
of the number K n of distinct species observed among the first n data Xi,..., X n , in [23] 
it has been shown that in the NGG (<J,/3) this distribution coincides with 

Pt j) (k) :=P[Kt ) = k\K n =j) 

(3.1) 

g(m, fc; a, -n + jo) EST 1 ("+7' 1 ) (-l)'/3' /g r(j + fc I /a; 0) 

for k = 0, . . . ,m, with $#(n, fc; s,r) denoting the non-central generalized factorial coeffi- 
cient. See [7] for a comprehensive account on generalized factorial coefficients. Expression 
(3.1) can be interpreted as the "posterior" probability distribution of the number of dis- 
tinct new species to be observed in a further sample of size m. Now, based on (3.1), one 
obtains the expected number of new species as 

m 

Et 3) :=E[K^\K n =j}=J2kP^' j \k), (3.2) 

k=0 

which corresponds to the Bayes estimator of under quadratic loss. Moreover, a 

measure of uncertainty of the point estimate Em can be obtained in terms of a- 
credible intervals that is, by determining an interval (21,22) with Z\ < zi such that 
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P[zi < Km < Z2\K n = j] > a. The interval (21,22) of shortest length is then typically 
referred to as highest posterior density interval. 

The main advantage of the distribution (3.1) is that it is explicit. However, since the 
sum of incomplete gamma functions cannot be further simplified, its computation can 
become overwhelming even for moderately large sizes of n and m. This fact represents 
a major problem in the frequent practical situations in which the size of the additional 
sample of interest is large. For instance, in genomic applications one has to deal with 
relevant portions of cDNA libraries which typically consist of millions of genes. Hence, it 

(n) 

is natural to study the asymptotics for K m , given K n , as m — > +00, in order to obtain 
approximations of (3.1) and, consequently, also of (3.2) and of the corresponding highest 

(n) 

posterior density intervals. Indeed, if one is able to show that a suitable rcscaling of K m , 
given K n , converges in law to some random variable, one can use the probability distri- 
bution of this limiting random quantity in order to derive the desired approximations. 



3.1. Asymptotic distribution 

The statement of the main result in the paper involves a positive random variable Y q 
whose density function is, for any q > 0, 

fy q (y) = ^f^\y q - 1 - 1 ^fAy- 1/ n 

aT(q + l) 

and we B a ,b to denote a beta random variable with parameters (a, b). Moreover, set S n j — 
Bj^n/cr-jYn/cr, with Bj_ n j a _j and Y n i a independent, and denote by gs nj the density 
function of S n j. 

Theorem 1. If (X n ) n >i is a species sampling sequence directed by a NGG(cr, 0) process 
prior, conditional on K n = j one has 



K {n) 
Jim 



Z n _j a.s. (3-3) 



as m — 5> +oo, where Z n j is a positive random variable obtained by exponentially tilting 
the density function of S n j , namely 



r(j)e-W'r" gSnii (z) 



Y^{i)(-W llo ?{j-il°\f}) 

Proof. The first part of the proof exploits a martingale convergence theorem along the 
same lines of [34], Theorem 3.8. In particular, let us start by computing the likelihood 
ratio 



M (n) _ a.p 



# 5 
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where &m = &{X n+ i, . . . ,X„+ m ), F^l is the conditional probability distribution of a 
normalized generalized gamma process with parameter (er, f3) given K n and, by virtue of 
[26], Proposition 1, 



iff* 1 ) Y~ vra + m— 1 (n+m— I s ! 



(-i)'/3'/ CT r(ic r 



(n) r 



for any integer K n > 1 and 



«S(^ B) ) 



(*») 



A" 



(») 



(n) r 



Hence, (M 

(n) 

tT,j3,m 



(») 



im r^m )m>l 1S a 



c(™) 

(7,0 ' 



rem, M 



has a 



j(") 
er,0 



martingale and by a martingale convergence theo- 
almost sure limit, say M: i, as m — > +oo. Clearly, we have that 



E^q[M^] = 1, where E^q denotes the expected value with respect to u 



r(«) 



00 



Let now 



(E n ) n >i be a sequence of i.i.d. random variables having a negative exponential distri- 
bution with parameter 1. Moreover, suppose the E n 's arc independent of (K n , Km ). 



Set #m™^ := EfcTi"^" 1 an d note that, conditionally on (K n , Km'), S'm > has gamma 



»(n) 



distribution with expected value K n + K m ^ . We can then rewrite p 



An) 



as follows 



M 



(n) 



r(K n ) 



i 



E^o (^(-lypVTiKn - l/<r t P) T(K n + Kg>) 

n+m— 1 



E 

(=0 



n + m— 1 
I 

r(K n ) 



/ y 



K n +K^> -i/tr-l - : y 



d// 







Er=o (VK- 1 )'^^" - V^;/?) r(K n + KW) 



+oo 



y K n +K^-l c -y 



r{K n ) 



i - 



ft 1 



l/a \ n+m-1 



l/a 



dy 



3l/cr \ n+m+1 



1 



From the strong law of large numbers, Sm^ j(K n + Km ) — ► 1 as m — > +oo and condi- 
tionally on (-Kn, Km ^). Using the dominated convergence theorem, we have 
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ol/cr \ n+m—l 

x II- 



((K n + K»)(g*/(K n + X" )))V" , 

r(K n ) ( p 1 '" 



eiTo 1 {^(-ly^nKn-i/a^) \ [K n +Kg>yi 

r(K n ) r p 1 ' 

■ exp< —m- 



") . M (n) 



as m — > +00. Since M^l m -> almost surely (with respect to P^g), then there 



exists some positive random variable, say L a ^ n such that mj '(Km^ 1 ) 1 ^ — > L a ^ n almost 

surely (with respect to P^o)- ] 
note that it must be such that 



surely (with respect to P^o)- In order to identify the probability distribution of L a _ n , 



Since S„ t K n = B Kntn i a _ Kn Y n i a , we have to prove that L a;n = S n % , that is, that the 
density function of L am coincides with 

jL " A) T{K n )T{n/a-K n ) 

ftosi /■y-°\ Kn ~ 1 / -cr\n/(T-K n -l 



v \ v 1 \ v 



So we simply have to show that the Laplace transform of the density function in (3.5) is 
given by (3.4). By a simple change of variable, x = v~ 1 / a , the previous density reduces to 



T(K n )T(n/a-K n ) J Juy '\x-°) \ x 

and by the change of variable y = z~ a /x~ a 

h,Jz) = V(K ^ ] K , z~ n f y-^/^-\l yy^-^Uzy 1 ^) dy. 
T{K n )Y{n/(j - K n ) J 

The Laplace transform of n is then given by 
J T(K n )T(n/a - K n ) 

x / 1 y-"/ ff + 1 / ff + A '"- 1 (l-j/)"/- A '"- 1 / CT (^ 1/a )dydz 



T(K n )T{n/a - K n ) J 
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CTr (") f 1 y -n/a+l/a+K„-lM _ y yi/cr-K n -l 

Jo 

y K n -l( 1 _ y) n/*-K n -l 

x / e Wv) v " h h- n f a (h)dhdy 



aT(n) 
T(K n )T{n/a-K n ) J 



r(n/a) 
T(K n )T(n/a - K n ) 



V»- 1 (l-y) n/ '-^- 1 
r ( ? V cr ) Jo 



According to the well-known gamma identity, we can write 

T(n/a) J Q h n Hn/cr) 7 J 

obtaining 

r ("/ CT ) f y K n -l( 1 _ y Xn/a-K n -l 



T(K n )T(n/a-K n ) J 



. + OO /* + 00 

x — / / e -^'^/ ff (A)dfcd« 

f yKn-lf 1 _ y \n/<T-K n -l 



Y{K n )Y{n/a-K n ) J 



x 



Hn/a) J 
T(K n )T(n/a-K n ) J 



I ~n/o-X I 1 IP 



1 



1- — c" z dzdy 



T(K n )T(n/a-K n ) 



x / u;^- 1 ^-^)*/'-*-- 1 !!- ( 1) 1/<T Y 'd^dz 
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T{K n )T{n/a-K n ) J p 



which corresponds to (3.4). Finally, since the probability measures Pl n ^. and P ™^ are 

mutually absolutely continuous, almost sure convergence holds true with respect to P^, 

as well. In order to deduce the P^-law of Z n jt n , it is sufficient to exploit a change of 
measure suggested by 



p(n) M x_ / dv (n) 



and by the fact that 



This completes the proof. □ 



It is worth stressing that the limit random variable in the conditional case is the same 
as in the unconditional case but with updated parameters and a rescaling induced by a 
beta-distributed random variable. The density of Z n _j in (3.3) can be formally represented 
as 



T(j)T(n/a-j) 

where we recall that f a is the density function of a positive stable random variable and, 
then, coincides with a density function of the random total mass of a c-stable CRM 
T a .a '■= £L a fi(K). Theorem 1 can be compared with an analogous result recently obtained 
in [10], Proposition 2, for the PD(tr, 9) process, where it is shown that the number of new 
distinct species K$ induced by the PD(<r,0) process is such that 



Z'i (3-7) 



as m —> +oo. where Z' n ■ = Bj +0 / (y n / (y _ : jY( g+n y a and the random variables Bj +e / cr n / (J _j 
and V(0+n)/cr are independent. This can be paralleled with the unconditional limit since 
it is known that Kn/n 17 — > Yg/ a , almost surely, as n — > oo. See, for example, [34], Theo- 
rem 3.8. 
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Remark. Note that a normalized cr-stable process coincides, in distribution, with both 
a NGG(er, 0) and a PD(cr, 0) process. Hence, it is no surprise that the two limits (3.3) and 
(3.7) are the same, in distribution, when (3 = 6 = 0. Another interesting case is represented 
by the normalized generalized gamma process with parameter (1/2,/?) which yield to the 
so-called normalized inverse- Gaussian processes [22]. In particular, for the NGG(l/2,/3) 
process the density /i/ 2 in (3.6) is known explicitly and the previous expression can be 
simplified to 



fz n Az) 



e -^/ z > 1/g rH^-y/ 2 - 1 

Ero 1 (V)Fi)^r(i - iW) ^r(2n^j) 



2n-j-l 

X 

1=0 



£ (*'7- 1 )(-.)Wrf- '- 



3.2. Sampling from the limiting random variable 

Since the above described limiting distributions cannot be easily handled for practical 
purposes, it is useful to devise a simulation algorithm. In this respect, one can adapt, 
similarly to [28] , an exact sampling algorithm recently devised by [9] for random variate 
generation from polynomially and exponentially tilted cr-stable distributions. This will al- 
low to sample the limiting random variables Z' n • and Z n j corresponding to the PD(ct, 9) 
and to the NGG(cr, (3) case, respectively. Indeed, note that Z' n - is a scale mixture involv- 
ing a beta random variable Bj + Qi atn i a _j and a positive random variable Y(a +n y a . The 

latter is such that its transformation Y, a , a sl admits density function of the form 

fy-v° (y) = T vS^7 t ^^^(y)i(o,oo ) (y), (3.8) 

r (9+«)/a 1 ((0 + n)/a + 1) 

which is precisely the density function of a polynomially tilted cr-stable distribution. 
Therefore, random variate generation from Z' n j can be easily done by independently 
sampling from a beta random variable with parameter (j + — j) and from a 

random variable with density function (3.8) by means of the algorithm devised in [9]. 
We refer to [10] for an alternative sampling algorithm for Z' n j via augmentation. Similar 
arguments can be applied in order to sample from the limit random variable Z n _j. Indeed, 
observe that Z n j is characterized by a density function proportional to 

\V 



d 



with gs n j being the density function of the random variable S n j = Bj n / a _jY n / a . There- 
fore, in order to sample from the distribution of Z n j one can apply a simple rejection 
sampling. In particular, the sampling scheme would work as follows 



(1) Generate B ~ Bj n /a-j- 
-V' 



(2) Sample Y ~ Y V° according to Devroye's algorithm. 
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(3) Set S = BY-' y . 

(4) Sample U from a uniform on the interval (0, 1). 
(4.a) If U < exp{-(£/ 1 S*) 1/<T } set Z = S. 

(4.b) HU> exp{-(/3/5) 1 /' 7 } restart from (1). 



3.3. Interpretation of asymptotic quantities 

In this final section, we provide a result that gives an interesting representation of the key 
random variable L a , n = S n X j a . To this end, we need to provide a representation for the 
posterior Laplace transform of the total mass of the cr-stable CRM fj, a ,o or, equivalently, 
of the unnormalized NGG(er, 0) or PD(ct, 0) processes. Indeed one has 

Proposition 1. Let (Xi)i>i be a species sampling sequence directed by a normalized 
a -stable process prior and suppose that the sample X±, . . . ,X n is such that K n = j . Then 



1 r°° { l 1 /' T \ n_1 
E [e-VA..oW| Xlj ... jXn]= jf j e -v dy (3 . 9) 

for any A > 0. 



Proof. Set T CT) o = /v.o (X) . Since the joint distribution of (K „ , JVi, . . . , Nk „ ) , also known 
as exchangeable partition probability function (see [34]), of a normalized cr-stable process 
coincides with P[(K n ,Ni,. ..,N Kn ) = (k,m, . . .,n k )} = a^ l T{j) Y[ k l=1 {l - a) ni _i/I», 



one has 

E[e- AT ^|Xi,...,X 



r(n) 1 



x f°° u^e-^+^a^ TT V[ ^~ (T) (u + X)- n *+° du 
Jo ti T ^-^ 

and a simple change of variable (u + A) CT = y yields the representation in (3.9). □ 

Proposition 1 allows one to draw an interesting comparison between unconditional 
and conditional limits of the number of distinct species. As we have already highlighted 
in Section 2, the probability distribution of the er-diversities for the NGG(cr, (3) process 
and the PD(<r, 9) process arise as a power transformation (involving the parameter a) 
of a suitable tilting of the probability distribution of T CT .o '■= /v.o(^)- We are now in the 
position to show that a similar structure carries over when one deals with the conditional 
case. Resorting to the notation set forth in Theorem 1, let T CTi o.if„ to be a random variable 
whose law coincides with the probability distribution of the conditional total mass T& q 
of a cr-stable process given a sample of size n containing K n distinct species. Hence, from 
the Laplace transform (3.4) in the proof of Theorem 1 one can easily spot the following 
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identity 

L a ,n=T<jfl^K n - (3.10) 

Let now P^ q and Pj^ ^ be the conditional probability distributions of, respectively, the 
cr-stable /v.o and the generalized gamma jl^p processes. According to Theorem 1, the 

probability distributions W a H q and P^"g are mutually absolutely continuous giving rise to 
the conditional counterpart of the identity (2.3), that is, 



dl 



o(«) 



M = ^n-wn-n, fff,,.^,- T, ^ expi-/? 1 / V (X)} (3.11) 



dP^ E^o 1 (^(-ly^Tti ~ l/<r,P) 



for any a £ (0, 1) and /3 > 0. In particular, if we denote by T^^^k^ the random vari- 
able whose probability distribution is obtained by exponentially tilting the probability 
distribution of T a $^K n as in (3.11), then one can establish that 

Znj = (T^kX*- (3-12) 

In other terms, one can easily verify that the probability distribution of the limit random 
variable Z n j in (3.3) can be also derived by applying to the probability distribution of 
Ta-,/3.K„ the same transformation characterizing the corresponding unconditional case. In 
a similar fashion, one can also derive the conditional counterpart of the identity (2.4) for 
the two parameter Poisson-Dirichlct process. Indeed, according to [10], Proposition 2, 

one can introduce a probability measure Q^™i on Afx whose Radon-Nikodym derivative 

(n) 

with respect to the dominating measure P^ q is given by 

dp W T(n)T{9/o- + j) 

for any a £ (0, 1) and 9 > — a with Q^"2 being the probability measure of the random 
measure \x* a g conditional on the sample. In particular, if we denote by T' e K the random 
variable whose probability distribution is obtained by polynomially tilting the probability 
distribution of T a Q K n as in (3.13), then one can easily verify that 

Z' nj = {T' a ^ K X a - (3.14) 

This suggests that the probability distribution of the limiting random variable Z' n ^ in 
(3.7) can also be derived by applying to the probability distribution of T' a s K the same 
transformation characterizing the corresponding unconditional case. 



4. Concluding remarks 

The identities (3.12) and (3.14) represent the conditional counterparts of the identities 
(2.6) and (2.8), respectively, given a sample containing K n distinct species. Hence, in the 
same spirit of [33], Proposition 13, we have provided a characterization of the distribu- 
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tion of the limiting random variables Z n j and Z' n ^ in terms of a power transformation 
(involving the parameter er) applied to a suitable tilting for the conditional distribution 
of the total mass of the a-stable process. In particular, the identities (3.12) and (3.14) 
characterize the distribution of the limit random variables Z n j and Z' n ^ via the same 
transformation characterizing the unconditional case and applied to an exponential tilt- 
ing and polynomial tilting, respectively, for a scale-mixture distribution involving the 
beta distribution and the er-stablc distribution. To conclude, there is a connection be- 
tween the prior, and posterior, total mass of a a-stable CRM that we conjecture can 
be extended to any Gibbs-type random probability measure and will be object of future 
research. 

Acknowledgements 

The authors are grateful to an Associate Editor and a Referee for valuable remarks and 
suggestions that have lead to a substantial improvement in the presentation. This work 
is partially supported by MIUR, Grant 2008MK3AFZ, and Regione Piemontc. 

References 

[1] Argiento, R., Guglielmi, A. and Pievatolo, A. (2009). A comparison of nonparametric 
priors in hierarchical mixture modelling for AFT regression. J. Statist. Plann. Inference 
139 3989-4005. MR2558344 

[2] Argiento, R., Guglielmi, A. and Pievatolo, A. (2010). Bayesian density estimation 
and model selection using nonparametric hierarchical mixtures. C'omput. Statist. Data 
Anal. 54 816-832. MR2580918 

[3] Brix, A. (1999). Generalized gamma measures and shot-noise Cox processes. Adv. in Appl. 
Probab. 31 929-953. MR1747450 

[4] Bunge, J. and Fitzpatrick, M. (1993). Estimating the number of species: A review. 
J. Am. Statist. Assoc. 88 364-373. 

[5] Carlton, M.A. (2002). A family of densities derived from the three-parameter Dirichlet 
process. J. Appl. Probab. 39 764-774. MR1938169 

[6] Chao, A. (2005). Species estimation and applications. In Encyclopedia of Statistical Sci- 
ences (N. Balakrishnan, C.B. Read and B. Vidakovic, eds.) 12 7907-7916. New 
York: Wiley. 

[7] Charalambides, C.A. (2005). Combinatorial Methods in Discrete Distributions. Hoboken, 
NJ: Wiley. MR2131068 

[8] De Blasi, P., Peccati, G. and Prunster, I. (2009). Asymptotics for posterior hazards. 

Ann. Statist. 37 1906-1945. MR2533475 
[9] Devroye, L. (2009). Random variate generation for exponentially and polynomially tilted 

stable distributions. ACM Trans. Model. Comp. Simul. 19 4. 
[10] Favaro, S., Lijoi, A., Mena, R.H. and Prunster, I. (2009). Bayesian non-parametric 

inference for species variety with a two-parameter Poisson-Dirichlet process prior. 

J. R. Stat. Soc. Ser. B Stat. Methodol. 71 993-1008. MR2750254 



Asymptotics for species variety 



17 



[11] Ghosal, S. (2010). The Dirichlet process, related priors and posterior asymptotics. In 
Bayesian Nonparametrics (N.L. Hjort, C.C. Holmes, P. Muller and S.G. Walker, 
eds.) 35-79. Cambridge: Cambridge Univ. Press. MR2730660 

[12] Gnedin, A. (2010). A species sampling model with finitely many types. Electron. Commun. 
Probab. 15 79-88. MR2606505 

[13] Gnedin, A. and Pitman, J. (2005). Exchangeable Gibbs partitions and Stirling trian- 
gles. Zap. Nauchn. Sem. S.-Peterburg. Otdel. Mat. Inst. Steklov. (POMI) 325 83-102. 
MR2160320 

[14] Griffin, J., Kolossiatis, M. and Steel, M. F. J. (2010). Modelling overdispersion with 
the normalized tempered stable distribution. Working Paper 10-01, CRiSM. 

[15] Griffin, J., Kolossiatis, M. and Steel, M. F. J. (2010). Comparing distributions using 
dependent normalized random measure mixtures. Working Paper 10-24, CRiSM. 

[16] Griffin, J.E. and Walker, S.G. (2011). Posterior simulation of normalized random mea- 
sure mixtures. J. Comput. Graph. Statist. 20 241-259. MR2816547 

[17] Griffiths, R.C. and Spano, D. (2007). Record indices and age-ordered frequencies in 
exchangeable Gibbs partitions. Electron. J. Probab. 12 1101-1130. MR2336601 

[18] Hjort, N.L., Holmes, C.C, Muller P. and Walker S.G. (2010). Bayesian Nonpara- 
metrics. Cambridge: Cambridge Univ. Press. MR2722987 

[19] James, L.F., LlJOI, A. and Prunster, I. (2006). Conjugacy as a distinctive feature of the 
Dirichlet process. Scand. J. Statist. 33 105-120. MR2255112 

[20] James, L.F., LlJOI, A. and Prunster, I. (2009). Posterior analysis for normalized random 
measures with independent increments. Scand. J. Stat. 36 76-97. MR2508332 

[21] Kingman, J.F.C. (1993). Poisson Processes. Oxford Studies in Probability 3. New York: 
Oxford Univ. Press. MR1 207584 

[22] Luoi, A., Mena, R.H. and Prunster, I. (2005). Hierarchical mixture modeling with nor- 
malized inverse-Gaussian priors. J. Amer. Statist. Assoc. 100 1278-1291. MR2236441 

[23] Luoi, A., Mena, R.H. and Prunster, I. (2007). Bayesian nonparametric estimation of 
the probability of discovering new species. Biometrika 94 769-786. MR2416792 

[24] Luoi, A., Mena, R.H. and Prunster, I. (2007). Controlling the reinforcement in Bayesian 
non-parametric mixture models. J. R. Stat. Soc. Ser. B Stat. Methodol. 69 715-740. 
MR2370077 

[25] Luoi, A. and Prunster, I. (2010). Models beyond the Dirichlet process. In Bayesian 
Nonparametrics (N.L. Hjort, C.C. Holmes, P. Muller and S.G. Walker, eds.) 
80-136. Cambridge: Cambridge Univ. Press. MR2730661 

[26] LlJOI, A., Prunster, I. and Walker, S.G. (2008). Bayesian nonparametric estimators de- 
rived from conditional Gibbs structures. Ann. Appl. Probab. 18 1519-1547. MR2434179 

[27] Luoi, A., Prunster, I. and Walker, S.G. (2008). Investigating nonparametric priors 
with Gibbs structure. Statist. Sinica 18 1653-1668. MR2469329 

[28] Montagna, S. (2009). Random probability measures and their applications to Bayesian 
Nonparametrics. M.Sc. thesis, Collegio Carlo Alberto, Moncalieri, Italy. 

[29] Navarrete, C. and Quintana, F.A. (2011). Similarity analysis in Bayesian random par- 
tition models. Comput. Statist. Data Anal. 55 97-109. 

[30] Navarrete, C, Quintana, F.A. and Muller, P. (2008). Some issues in nonparametric 
Bayesian modelling using species sampling models. Stat. Model. 8 3-21. MR2750628 

[31] Peccati, G. and Prunster, I. (2008). Linear and quadratic functionals of random hazard 
rates: An asymptotic analysis. Ann. Appl. Probab. 18 1910-1943. MR2462554 

[32] Pitman, J. (1996). Some developments of the Blackwell-MacQueen urn scheme. In Statis- 
tics, Probability and Game Theory (T.S. Ferguson, L.S. Shapley and J.B. Mac- 



18 



S. Favaro, A. Lijoi and I. Priinster 



Queen, eds.). Institute of Mathematical Statistics Lecture Notes — Monograph Series 

30 245-267. Hayward, CA: IMS. MR1481784 
[33] Pitman, J. (2003). Poisson-Kingman partitions. In Statistics and Science: A Festschrift 

for Terry Speed (D.R. Goldstein, ed.). Institute of Mathematical Statistics Lecture 

Notes— Monograph Series 40 1-34. Beachwood, OH: IMS. MR2004330 
[34] Pitman, J. (2006). Combinatorial Stochastic Processes. Lecture Notes in Math. 1875. 

Berlin: Springer. MR2245368 
[35] Regazzini, E., Lijoi, A. and Prunster, I. (2003). Distributional results for means of 

normalized random measures with independent increments. Ann. Statist. 31 560-585. 

MR1983542 

Received August 2010 and revised January 2011 



